Dear reader,

This supplementary material is structured as follows:

Paper.pdf
* Full paper with appendix

IRL_Gridworld_Public.ipynb
* All experiments for the gridworld domain (Section 6.1)

LunarLanderContinuous_Public.ipynb
* All experiments for the continuous control domain (Section 6.2)


Sincerely,

The authors
